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Abstract 

This paper gives a technically elementary treatment of some aspects of Hamilton- 
Jacobi theory especially in relation to the calculus of variations. The second half 
of the paper describes the application to geometric optics, the optico-mechanical 
analogy and the transition to quantum mechanics. Finally, I report recent work 
of Holland providing a Hamiltonian formulation of the pilot-wave theory. 
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'Dont worry, young man: in mathematics, none of us really understands any idea — we 
just get used to them. ' 

John von Neumann, after explaining (no doubt very quickly!) the method of character- 
istics (i.e. Hamilton-Jacobi theory) to a young physicist, as a way to solve his problem; 
to which the physicist had replied 'Thank you very much; but I'm afraid I still don't 
understand this method.' 
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1 Introduction 



In the eighty years since its discovery in the mid-1920s, quantum mechanics has gone 
from strength to strength. It has repeatedly been proved successful, to a high degree 
of accuracy, in domains of application very different from its original one. For exam- 
ple, although it was devised for systems of atomic dimensions (10~ 8 cm), it has since 
proven accurate for scales much smaller (cf. the nuclear radius of ca. lCT 12 cm) and 
vastly larger (cf. superconductivity and superfluidity, involving scales up to 10 _1 cm). 
Similarly, if we think of domains of application, not as length (or energy) scales, but 
as types of "stuff" to which the theory applies. Though quantum mechanics was first 
devised to apply to matter (i.e. electrons and protons, the more "obvious" constituents 
of atoms), it was soon extended to fields, i.e. the electromagnetic field: indeed, matter 
soon became regarded as excitations in associated fields. Similarly, if we think of do- 
mains of application as types of force: though first devised for electromagnetic forces, 
quantum mechanics now successfully describes the weak and strong forces. Indeed, 
similarly for 'domains' understood naively, as regions of the universe: quantum me- 
chanics has also been applied with great success to astronomy — the obvious examples 
being the use of nuclear physics in theories of stellar structure and evolution, and of 
particle physics in theories of the early universe. 

So quantum mechanics has been an amazing success story. I stress this point 
at the outset, for two reasons. First: it is, unfortunately, all too easy to get used 
to success. Nowadays, both physicists, for whom the various quantum theories have 
become everyday professional tools, and the wider scientifically literate public, can 
easily lose their sense of wonder at this immense success. So it is worth remembering 
how contingent, and surprising, it is. 

My second reason is more specific to work in the foundations and-or philosophy 
of quantum theory. This work focusses on the interpretative problems, especially the 
measurement problem, that still confront quantum mechanics, despite its immense 
empirical success: hence this volume's question 1 Quo vadis, quantum mechanics?' Of 
course, I endorse that focus: it is crucially important to address these problems. But in 
addressing them, it is salutary to recall this success, as an intellectual backdrop. Indeed, 
not only is it salutary: it might also be heuristically useful — though of course, differ- 
ent researchers, with their different intellectual temperaments, will take this success 
to give different heuristic clues about 'Quo vadis, quantum mechanics?'. For example, 
an Everettian philosopher such as Saunders (??this volume) may see the success of 
the established quantum theoretic formalisms as supporting their position: certainly, 
heterodox quantum theories such as dynamical models of wave-function collapse face 
an enormous task in recovering that success. On the other hand, a theoretical physi- 
cist who is searching for a successor to quantum mechanics — whether to solve these 
interpretative problems or to reconcile the quantum with general relativity's treatment 
of gravitation, or both (such as 't Hooft, ??this volume) — may scrutinize the details of 
this empirical success for clues about how present-day quantum mechanics might be 
an effective, i.e. phenomenological, theory. As 't Hooft wittily puts it: we can ask, 



not ''Quo vadis, quantum mechanics?', but rather ' Unde venis? — 'Where do you come 
from?' 

This paper will likewise ask 'Unde venis, quantum mechanics?': though I humbly 
admit that I will interpret this question in a retrospective and expository sense, rather 
than in 't Hooft's wonderfully forward-looking and creative sense. To be specific: I 
propose to discuss Hamilton-Jacobi theory as a classical root of quantum mechanics. 

One part of this story is well known to physicists and philosophers and historians 
of physics. Namely: Hamilton-Jacobi theory as a method of integrating Hamilton's 
equations (using Jacobi's theorem, action-angle variables etc.), and the use made of 
this integration theory in nineteenth century celestial mechanics, and thereby in the 
old quantum theory. 

There is however another part of this story that seems much less known by this 
community: viz. Hamilton-Jacobi theory understood from the perspective of the cal- 
culus of variations (as worked out by such masters as Hilbert and Caratheodory), and 
how this understanding motivates deBroglie's and Schrodinger's proposal to extend 
Hamilton's optico-mechanical analogy, thus creating quantum mechanics (as wave me- 
chanics) . So I propose to present this part of the story: or rather, since this part could 
fill a book — selected pieces of it! (My (2003, 2003a) discuss some other, philosophical, 
aspects.) At the end of the paper, I shall also briefly return to 1 Quo vadis?\ i.e. to a 
current interest in the foundations of quantum theory: viz. the pilot-wave theory — on 
which Hamilton-Jacobi theory casts some light. But I begin, in the next Subsection, 
with a more detailed prospectus. 

1.1 Introducing Hamilton-Jacobi theory 

Hamilton-Jacobi theory is a general theory, rich in analytic and geometric ideas, that 
unifies three apparently disparate topics: systems of first order ordinary differential 
equations, first order partial differential equations, and the calculus of variations. 
Roughly speaking, Hamilton-Jacobi theory shows that the following problems are 
equivalent: — 

(ODE): solving a canonical system of first order ordinary differential equations (2n 
equations for 2n functions of a parameter t in which all variables' first derivatives are 
given by partial derivatives of one and the same function); e.g. Hamilton's equations 
in Hamiltonian mechanics. 

(PDE): solving a first order partial differential equation in which the unknown func- 
tion does not occur explicitly; e.g. the Hamilton-Jacobi equation in mechanics. 

(CV): solving the "basic" calculus of variations problem of finding n functions 
qi, . . . , q n of a parameter t that make stationary a line-integral of the form J L(qi,qi, t) dt, 
where the dot denotes differentiation with respect to t; e.g. Hamilton's principle in 
Lagrangian mechanics, or Fermat's principle in geometric optics. 

A bit more precisely: elementary Lagrangian and Hamiltonian mechanics show 
(ODE) and (CV) to be equivalent for the case of fixed end-points. Hamilton-Jacobi 
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theory extends this equivalence by considering, not a single solution of the canonical 
equations (a single line-integral) but a whole field of solutions, i.e. line-integrals along 
all the curves of a space-filling congruence (so that the end-points lie on hypersurfaces 
transverse to the congruence). The initial conditions of a problem then become the 
specification of a function's values on such a hypersurface, instead of an initial config- 
uration and momentum (or an initial and final configuration): hence the occurrence of 
partial differential equations. 

The main aim of this paper is to explain (in an elementary way) these equivalences 
and some related results. This explanation will later (Sections [7| and |SJ) provide us with 
a perspective on the optico-mechanical analogy and quantum mechanics (specifically, 
wave mechanics). But there is also a pedagogic rationale for presenting these results. 
Most physicists learn Hamilton- Jacobi theory only as part of analytical mechanics; 
and almost all the mechanics textbooks present, in addition to the equivalence of 
(ODE) and (CV) for fixed end-points, only the use of Hamilton- Jacobi theory as a 
method of integrating Hamilton's equations — indeed rendering the integration trivial. 
The central result here is Jacobi's theorem: that given a complete integral of the 
Hamilton- Jacobi equation (typically found by separation of variables), one can obtain 
solutions of Hamilton's equations just by differentiation. This is a remarkable result, 
which lies at the centre of a beautiful geometric theory of the integration of first order 
partial differential equations: a theory which reduces the integration problem to that 
of integrating a suitable system of ordinary differential equations (the characteristic 
equations). But almost all the mechanics textbooks present Jacobi's theorem using 
just canonical transformation theory: as a result, they do not describe this general 
integration theory — and more generally, they do not show the role of geometric ideas, 
nor of the calculus of variations with variable end-points. 

This textbook tradition is of course understandable. Textbooks must emphasise 
problem-solving; and the use of a complete integral of the Hamilton- Jacobi equation to 
solve Hamilton's equations, is crucially important, for several reasons. As to problem- 
solving, it is 'the most powerful method known for exact integration, and many prob- 
lems which were solved by Jacobi cannot be solved by other means' (Arnold 1989, p. 
261). Besides, it is conceptually important: it leads on to action-angle variables, which 
are central both to classical mechanics (e.g. in the Liouville-Arnold theorem, and in 
perturbation theory) and the old quantum theory. 

But though understandable, this tradition is also regrettable. For the result is that 
most physicists understand well only the equivalence of (ODE) and (CV) for fixed 
end-points, and a part of the equivalence of (PDE) and (ODE) — the part expressed 
by Jacobi's theorem. Besides, they understand these matters only in the context of 
mechanics. This is a pity, for two reasons. 

First, it is worth stressing that all these equivalences and related other results, 
are purely mathematical and so entirely general. Second, the equivalences and results 
that get omitted from most mechanics textbooks are at least as rich as those included; 
in particular, in their use of geometric ideas. I might add: 'in their use of optical 
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ideas'. Indeed, Hamilton developed his work in mechanics in deliberate analogy with 
his previous work in optics. 2 And as we shall see: both Fermat's principle (roughly, 
that a light ray travels the path that takes least time) and Huygens's principle (roughly, 
that given a wave- front, a later wave- front is the envelope of spherical waves spreading 
from the points of the given wave-front) stand at the centre of Hamilton- Jacobi theory. 
They involve each of the above mathematical problems, in optical guise: viz. the 
description of light in terms of rays (exemplifying (ODE)), in terms of wavefronts (cf. 
(PDE)), and by means of variational principles (cf. (CV)). 

Accordingly, I propose to expound some of these equivalences and connections, as 
mathematics (Sections 121 to EJ). Then I will illustrate them with geometric optics and 
the optico- mechanical analogy (Sections and |HJ). 

To be both brief and elementary, this exposition must be very selective. In particu- 
lar, I will say nothing about: (i) weak solutions; (ii) the use of phase space; (iii) issues 
about the global existence of solutions, including focussing and caustics. 3 Another 
omitted topic lies closer to our concerns: I will not present the theory surrounding 
Jacobi's theorem, i.e. Hamilton- Jacobi theory as an integration theory for first order 
partial differential equations. For though I have complained that this is absent from 
the mechanics books, it is in some books on mathematical methods. 4 

Instead, I will adopt an approach that emphasises the calculus of variations. The 
main ideas here seem to be due to Caratheodory and Hilbert. Here again, I must be 
selective: I will simply pick out within this approach, one line of thought, found for 
example in the first half of Rund (1966). (Rund proves some results which I will only 
state; and he cites the original papers.) Though selective, this exposition will give a 
good sense of the triangle of equivalences between (ODE), (PDE) and (CV); indeed, 
we will get such a sense already by the end of Section El Sections |U to HI will add to this 
a discussion of three topics, each leading to the next. They are, respectively: Hilbert's 
independent integral; treating the integration variable of the variational problem on 
the same footing as the other coordinates; and integration theory. 

Thereafter, Sections [7| et seq. return us to physics. Section [7| discusses geometric 

2 For a glimpse of the history, which I will not discuss, cf. e.g.: for mechanics, Dugas (1988), 
Whittaker (1959); for optics, Whittaker (1952), Buchwald (1989); and for mathematics: Kline (1970, 
Chap. 30). 

3 A few pedagogic references: for (i) Logan (1994, Chap. 3), Stakgold (1967); for (ii), Arnold (1989, 
Chap.s 8, 9), Littlejohn (1992), Taylor (1996, Section 1.15); for (iii), Arnold (1989, Appendices 11, 
16), Benton (1977), Taylor (1996, Section 6.7). Of these topics, (ii) and (iii) are closest to this paper's 
interests in geometry, and in the transition between classical and quantum mechanics. For (ii), i.e. 
Hamilton- Jacobi theory in phase space, beautifully illustrates symplcctic geometry; and (ii) and (iii) 
are crucial in both quantization theory and semiclassical mechanics. 

Especially Courant and Hilbert (1962, Chap. II. 1-8); cf. also e.g. Webster (1950, Chap 2) 
and John (1971, Chap 1). In order to be elementary, I will also avoid all use of modern differential 
geometry, including even the distinction between contravariant and covariant indices. Though modern 
geometry has transformed our understanding of differential equations and the calculus of variations 
(and the sciences of mechanics and optics), I shall only need the intuitive geometry familiar from 
multivariable calculus. 
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optics; and Section |H1 the optico-mechanical analogy and wave mechanics. Section |H] 
also leads us back to the foundations of quantum mechanics: which I take up briefly in 
(the last) Section El Here I will call attention to the role of Hamilton- Jacobi theory in 
the pilot-wave theory of deBroglie and Bohm; and more specifically advertise Holland's 
recent work (2001, 2001a), which provides a Hamiltonian formulation of the pilot-wave 
theory. 

2 From the calculus of variations to the Hamilton- 
Jacobi equation 

2.1 The calculus of variations reviewed 

We begin by briefly reviewing the simplest problem of the calculus of variations; with 
which we will be concerned throughout the paper. This is the variational problem (in 
a notation suggestive of mechanics) 

6I:=6I[qi] = 5 f h L{ qi ,qi,t)dt = , i = l,...,n (2.1) 

Jt 

where [ ] indicates that J is a functional, the dot denotes differentiation with respect 
to t, and L is to be a C 2 (twice continuously different iable) function in all 2n + 1 
arguments. L is the Lagrangian or fundamental function; and / L dt is the fundamental 
integral. We will discuss this only locally; i.e. we will consider a fixed simply connected 
region G of a (n + l)-dimensional real space R n+ , on which there are coordinates 
(q 1 ,...,q n ,t) =: (q i: t) =: (q,t). 

The singling out of a coordinate t (called the parameter of the problem), to give 
a parametric representation of curves q(t) := qi(t), is partly a matter of notational 
clarity. But it is of course suggestive of the application to mechanics, where t is 
time, q represents the system's configuration and (gj,t)-space is often called 'extended 
configuration space' or 'event space'. Besides, the singling out of t reflects the fact 
that though it is usual to assume that L (and so the fundamental integral) is invariant 
under arbitrary transformations (with non- vanishing Jacobian) of the qi, we do not 
require the fundamental integral to be independent of the choice of t. Indeed we shall 
see (at the end of this Subsection and in Section [SJ) that allowing this dependence is 
necessary for making Legendre transformations. 5 

A necessary condition for / to be stationary at the C 2 curve q(t) := <&(t) — i.e. 
for 51 = in comparison with other C 2 curves that (i) share with q(t) the end- 
points q(to),q(ti) and (ii) are close to q(t) in both value and derivative throughout 

5 Of course, the calculus of variations, and Hamilton-Jacobi theory, can be developed on the as- 
sumption that the fundamental integral is to be parameter-independent — if it could not be, so much 
the worse for relativistic theories! But the details, in particular of how to set up a canonical formalism, 
are different from what follows. For these details, cf. e.g. Rund (1966, Chapter 3). 



t < t < t\ — is that: q(t) satisfies for t < t < t\ the n second-order Euler-Lagrange 
(also known as: Euler, or as Lagrange!) equations 

^L qi -L qi = i = l,...,n. (2.2) 

A curve satisfying these equations is called an extremal. 

We will not need to linger on the usual derivation of these equations: we will later 
see them derived without using a single fixed pair of end-points. Nor need we linger on 
several related matters taken up in the calculus of variations, such as: the distinction 
between stationarity and extrema (i.e. maxima or minima), in particular the conditions 
for a curve to be an extremum not just a stationary point (e.g. conditions concerning 
the second variation of the fundamental integral, or Weierstrass' excess function); the 
distinction between weak and strong stationary points and extrema; and the use of 
weaker assumptions about the smoothness of the solution and comparison curves. 

But it is important to consider the canonical form of our variational problem. In 
physics, the most frequent example of this is the expression of Hamilton's principle 
within Hamiltonian mechanics; i.e. Hamilton's principle with the integrand a function 
of both qs and ps, which are to be varied independently. But the correspondence 
between the Lagrangian form of the variational problem (above) and the canonical 
form is general (purely mathematical). 

Thus, under certain conditions the variational problem eq. 12.11 has an equivalent 
form, whose Euler-Lagrange equations are 2n first order equations. To this end, we 
introduce "momenta" 

Pi ■= Lq t ; (2.3) 

and (recalling that L is C 2 ) we assume that the Hessian with respect to the qs does 
not vanish in the domain G considered, i.e. the determinant 

I ¥ o ; 

so that eq. 12.31 can be solved for the <ji as functions of qi,Pi,t 

Then the equations 

Pi = L qi qi = H Pz L(q h q u t) + H(q h p u t) = E^pi (2.5) 

represent a Legendre transformation and its inverse; where in the third equation qi are 
understood as functions of qi,Pi,t according to the inversion of eq. 12.31 The function 
H(qi,pi, t) is called the Legendre (or. Hamiltonian) function of the variational problem, 
and the qs and ps are called canonically conjugate. It follows that H is C 2 in all its 
arguments, H t = —L t , and | L qiqj \ = \ H PiP , Besides, any H(qi,Pi,t) that is C 2 in 
all its arguments, and has a non- vanishing Hessian with respect to the ps, \ H PiPj |^ 0, 
is the Legendre function of a C 2 Lagrangian L given in terms of H by eq. 12.51 

Applying this Legendre transformation, the Euler-Lagrange equations eq. 12.21 go 
over to the canonical system 

q z = H Pi p t = -H qi (=L q J . (2.6) 
fi 



(2.4) 



(A curve satisfying these equations is also called an extremal.) These are the Euler- 
Lagrange equations of a variational problem equivalent to the original one, in which 
both qs and ps are varied independently, namely the problem 

5 J (E iqiPi -H(q h p h t)) dt = . (2.7) 

(For more details about eq. O to O cf. e.g. Arnold (1989, Chap. 3.14, 9.45.C), 
Courant and Hilbert (1953, Chap. IV. 9. 3; 1962, Chap. 1.6) and Lanczos (1986, Chap. 
VI. 1-4).) 

The requirement of a non- vanishing Hessian, eq. 12.41 fequivalentlv: | H PiPj |^ 0), is 
a crucial assumption. Note in particular these two consequences. 

1) The Hamiltonian cannot vanish identically. Proof: If we differentiate H = 
^QiPi — L = with respect to we get Sj L qiq ,qi = 0; which contradicts eq. 12.41 

2) L cannot be homogeneous of the first degree in the That is, we cannot have: 
L(qi, \(ji,t) = XL(qi,qi,t). We shall see in Section El that this means the fundamental 
integral cannot be parameter-independent. 



2.2 Hyper surf aces and congruences 

We consider a family of hypersurfaces in our region G of IR n+1 

S( qi ,t)=a (2.8) 

with a G IR the parameter labelling the family, and S a C 2 function (in all n + 1 
arguments). We assume this family covers the region G simply, in the sense that 
through each point of G there passes a unique hypersurface in the family. 

Let C be a curve 

qi = qM (2.9) 

of class C 2 , that lies in G and intersects each hypersurface in the family eq. 12.81 just 
once, but is nowhere tangent to a hypersurface. Then a is a function of t along C, with 

da ^ dS . dS 

By construction A ^ 0. We will assume that the Lagrangian L does not vanish along 
C. By a suitable labelling of the family of surfaces, we can secure 

A > or < according as L > or < (2.11) 

for the line-element (qi, qi, t) of C. Then a tangential displacement along C from P := 
(qi, t) to Q := (qi + dqi, t + dt), i.e. a displacement with components (dqi, dt) = (qi, l)dt, 
induces an increment da in a, and an increment dl = L(qi, qi, t)dt in / = J L dt. 
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To connect this family of hypersurfaces with the calculus of variations, we now seek 
values of at P such that the direction at P of the curve C, (<ji, l)dt, makes dl /da a 
minimum with da fixed. A necessary condition is that 



7T- T- =°> i = l,...,n. (2.12) 




But £ = i and A ^ °' so that ec l- EH21 reads 

dL L dA 

dqi A dtji 

that is, using §| = g from eq. I27T7H 



= ; (2.13) 



dL _ LdS 
dqi A % 



(2.14) 



A curve C, or its tangent vector (<ji,l), that satisfies eq. 12.141 is said to be in the 
direction of the geodesic gradient determined by the family of surfaces \'2.Sl 

As it stands, this condition eq. 12.141 can at best yield minima of dl/da; while we 
are interested in minima of dl/dt. But there is a further condition on the family of 
surfaces eq. I2.8l that implies that curves obeying eq. 12. 141 are solutions of the variational 
problem; or rather, to be precise, extremals. 

This condition has two equivalent forms; the first geometric in spirit, the second 
analytic. They are: 

(a) : that the quantity L/A is constant on each surface, i.e. there is some real 
function such that 

^ = <K<t) (2.15) 

where we are to take the directional arguments in L to refer to the geodesic gradient. 

(b) : that S solves the Hamilton- Jacobi equation. 

It is straightforward to show that (a) implies that we can re-parametrize the family 
of surfaces in such a way that L = A throughout the region G. That is to say: given 
(a), the family can be re-parametrized so that function <fr is the constant function 1: 
4>{a) = 1. (Proof: any monotonic function ip gives a re-parametrization of the family, 
ip(S) = ip(a), with A defined on analogy with A by A := ^V'(cr) = ip'(a)A. Choosing 



-0(c) := fv 4>(s) ds (o"o some constant) yields ip'(a) = 4>(a) so that = ^,^ A 



So to show (a) and (b) equivalent, we will show that: 

(i) given (a) in this special form, i.e. given L = A, S solves the Hamilton- Jacobi 
equation; and conversely 

(ii) S solving the Hamilton- Jacobi equation implies that L = A. 

But it will be clearest, before proving this equivalence, to present two consequences of 
L = A, and introduce some terminology. 
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First: L = A implies that the geodesic gradient, eq. 12.141 is now given by 

dL _ _dS 

where the first equation uses eq. 12.31 Recall now our assumption that the determinant 
I -k<ji<?j M 0; so that eq. 12.31 can be solved in G for the as functions of qi,pi,t: 
Qi — Qi(Qi,Pi,t). This now reads as 

dS 

?< = ?»(?<,«-,*), (2-17) 

where the right-hand side is a function of alone (since S 1 is) and has continuous 

first order derivatives. Then the elementary existence theorem for solutions of first 
order ordinary differential equations implies that eq. 12.171 defines an n-parameter 
family of curves in the region G, such that each point in G has a unique curve pass 
through it, and each curve is a solution of eq. 12.171 in the sense that the components 
of its tangent vectors obey eq. 12.171 This family of curves is called the congruence K 
belonging to the family of surfaces eq. 12.81 

Second: L = A implies that the increment dl in the fundamental integral / = 
/ L dt, in passing from a point Pi on the surface S(qi,t) = <Ti, to an adjacent surface 
S = a i + da, along a curve of the congruence belonging to the family, obeys 

dI = Adt = da. (2.18) 

Integrating this result along members of the congruence, we get: the integral along a 
curve of the congruence, from any point Pi on the surface S(qi,t) = o~\ to that point 
P 2 on the surface S(qi,t) = a 2 that lies on the same curve of the congruence, is the 
same for whatever point P 1 we choose. That is: 

/ Ldt = a 2 -a 1 . (2.19) 
JPi 

Clearly, the converse also holds: if the fundamental integral taken along curves of the 
congruence has the same value for two hypersurfaces, however we choose the end-points 
Pi,P 2 lying in the hypersurfaces, then L = A. So a family of surfaces satisfying the 
condition that L = A is called geodesically equidistant with respect to the Lagrangian 
L. (Courant and Hilbert (1962, Chap. II. 9. 2) say 'geodetic', not 'geodesic'; which has 
the advantage of avoiding 'geodesic" s possibly confusing connotations of metric and-or 
connection.) 

Caratheodory called a family of geodesically equidistant hypersurfaces, together 
with the congruence belonging to it, the complete figure (of the variational problem). 
As we shall see, the name is apt, since the complete figure is central to Hamilton- Jacobi 
theory. Also, the congruence is called transversal to the surfaces of the family. The 
analytical expression of transversality is that for a displacement St) tangential to 
a hypersurface in the family, 5S = 0. That is: 

* + = (2.20) 
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We turn to showing that: (i) L — A implies that S solves the Hamilton- Jacobi 
equation. 

Proof: Eq. ICTH yields 

N A da ^ dS dS . 
L(q l ,q l ,t) = A:=- = ^-q l + - (2.21) 

where tji refers to the direction of the geodesic gradient, eq. 12.171 i- e - Qi — QiiVh fr>A)- 
This yields 

OS . . dS . > dS , dS . , 

= -Lfe,^, —,*),*) + S< — ft ( ft> — ,*). (2.22) 

But eq. 12.51 implies that the right-hand side is the Hamiltonian function, but with Pi 
replaced by |^ in accordance with eq. 12.161 Thus we have 

f + *(*,|[, «)=<>; (2.23) 

which is the Hamilton- Jacobi equation. 

This equation is also a sufficient condition of a family of surfaces being geodesically 
equidistant. That is, (ii): S being a C 2 solution in G of the Hamilton- Jacobi equation 
implies that L — A, i.e. that the hypersurfaces of constant S are geodesically equidis- 
tant. 

Proof: Given such a solution S(qi,t), let us define an assignment to each point of 
G (sometimes called a field) by 

dS 

Pi = Pi (q h t):= — . (2.24) 

By eq. 12. 4[ this determines a field as in eq. 12.171 and hence a congruence. Then for 
the given solution S, a given member C of the congruence, and two given parameter 
values cri,(T2, we form the fundamental integral along C between the points Pi and 
P 2 where C intersects the hypersurfaces corresponding to the parameter values 01,02 . 
Using the Legendre transformation, eq. 12.51 and the fact that S solves the Hamilton- 
Jacobi equation, eq. 12.231 we obtain: 



I L dt = - H(q h — ,t) - T.iPiqi dt = / —dt + E^— dq { = dS = a 2 -cri 

JPl J Pi OQi J Pi Ot OQi JPi 



OS 



dqi 



Pi 



dS , „ OS 



rP 2 
'Pi 



(2.25) 

To sum up: a family of hypersurfaces S — a is geodesically equidistant with re- 
spect to the Lagrangian L iff S is a solution of the Hamilton- Jacobi equation whose 
Hamiltonian H corresponds by the Legendre transformation to L. And if this holds, 
the transversality condition, eq. I2.2U1 can be written (using eq. 12.231 and 12.24)) as 

p i 5q i -H{q l ,p l ,t)5t = 6 (2.26) 



6 Transversality can also be defined, without any use of a family of hypersurfaces, or even a function 
S, in terms of the fundamental integral being stationary as an end-point of the integral varies on a 
given single surface. Cf. e.g. Courant and Hilbert (1953, Chap. IV. 5. 2). 
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3 Canonical and Euler- Lagrange equations; fields 
of extremals 



We now study the properties of a congruence K belonging to a family of geodesi- 
cally equidistant surfaces. We first show that any curve of such a congruence obeys 
the canonical and Euler-Lagrange equations. Then we develop the ideas of: a field 
qi,Pi in the region G; and a field belonging to a family of (not necessarily geodesically 
equidistant) hypersurfaces. Finally we characterize those fields belonging to geodesi- 
cally equidistant hypersurfaces. 



3.1 Canonical and Euler-Lagrange equations 

The family eq. 12.81 defines an assignment of Pi := to each point of a member C of 
the congruence K. If we differentiate the definition of p i.e. eq. 12.241 with respect to t 
along C, and we differentiate the Hamilton- Jacobi equation eq. I2.23( and we then use 
the fact (from eq. 12.6)) that (ji = we can eliminate the second derivatives of 5* that 
arise in the differentiations, and get: 

OH , . 

dqi 

To this, we can adjoin qi = so as to get 2n first order ordinary differential equations 



obeyed by members of K 



OH . dH 
Pi = ; 9i = ^— ■ (3.2) 

dqi dpi 



Note that according to this deduction, these two groups of equations have different 
statuses, despite their symmetric appearance. Pi = —jt~ depends on K belonging to a 
family of geodesically equidistant surfaces (i.e. on the Hamilton- Jacobi equation ). But 
<ji — ^7 are identities derived from the theory of the Legendre transformation (cf. eq. 
12. fjjl . But this difference is not peculiar to our deduction's use of hypersurfaces. The 
same difference occurs in derivations of these equations in the calculus of variations 
with fixed end-points: in the most familiar case, in Lagrangian mechanics i.e. without 
use of the canonical integral; (cf. e.g. Lanczos (1986, p. 166-7). 

From the canonical equations we can deduce the (Lagrangian form of the) Euler- 
Lagrange equations. We substitute p { = in the left-hand side, and = — |^ in 
the right-hand side, of the first of eq. 13.21 to get 
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3.2 Fields 



To discuss fields, we need first to consider parametric representations of an arbitrary 
smooth congruence of curves covering our region G simply. That is, we consider a 
congruence represented by n equations giving q^ as C 2 functions of n parameters and t 

qi = qi(u a ,t) i = l,...,n (3.4) 

where each set of n u a = (u\, . . . ,u n ) labels a unique curve in the congruence. Thus 
there is a one-to-one correspondence (qi,t) <-> (u a ,t) in appropriate domains of the 
variables, with non-vanishing Jacobian 

Such a congruence determines tangent vectors (<&,!.) at each (qi,t); and thereby 
also values of the Lagrangian L(qi(u a ,t),qi(u a ,t),t) and of the momentum 

dL 

Pi = Pi{u a ,t) = — ■ (3.6) 

Conversely, a set of 2n C 2 functions qt,Pi of (u a , t) as in eqs 13.41 and !3.61 with the qs and 
ps related by pi = |4, determines a set of tangent vectors, and so a congruence. Such a 
set of 2n functions is called a field; and if all the curves of the congruence are extremals 
(i.e. solutions of the Euler-Lagrange equations), it is called a field of extremals. 

We say a field belongs to a (not necessarily geodesically equidistant) family of hy- 
persurfaces given by eq. 12.81 iff throughout the region G eq.s 12.161 and 13.41 are together 
satisfied, i.e. iff we have 

d d 

Pi = -z-S(qi,t) = —S(qi(u a ,t),t) . (3.7) 
c% dqi 

One can show that a field belongs to a family of hypersurfaces iff for all indices a, (3 — 
1, . . . , n, the Lagrange brackets of the parameters of the field, i.e. 

\ u U |. = S ( ^ d Pi d Qi d Pi \ fggN 

l \du a du/3 dupdu a ) 

vanish identically. 7 

We say that a field = qi(u a ,t),pi = pi(u a ,t) is canonical if the qi,Pi satisfy eq. 
13.21 Now we will show that if a canonical field belongs to a family of hypersurfaces eq. 
12.81 then the members of the family are geodesically equidistant. 

7 Cf. Rund (1966, p. 28-30). Warning: the role of Lagrange brackets in this theory is sometimes 
omitted even in excellent accounts, e.g. Courant and Hilbcrt (1962, Chap. II. 9. 4). 
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Proof: Differentiating eq. 13.71 with respect to t along a member of the congruence, 
and substituting on the left-hand side from the first of eq. 13.21 we get 

_d_H_ 8 2 S , d 2 S 

dqi dqidq^ 3 dqidt 

By the second of eq. 13.21 this is 

92S + o (3.10) 



dqidt dqi dpj dqidq 

which is 

dS 




+ H{q ^dq~ ,t) ) =0 (3 ' 11} 



which is integrated immediately to give 

dS TTI dS 

m +H{q -d- qj 



+ H(q 3 ,—,t) = f(t) (3.12) 



with / an arbitrary function of t only. Now we argue (in the usual way, for the 
calculus of variations) that this function can be absorbed in H. For suppose the given 
Lagrangian were replaced by L = L + /(£). The path-independence of the integral 
J f(t) dt implies that L and L give equivalent variational problems, i.e. the same 
curves give stationary values for both J L dt and J L dt. Besides, the definition of pi, 
eq. 12. 3[ and the canonical equations eq. 12.61 are unaffected, the only change in our 
formalism being that H is replaced by H — H — f(t). So assuming that L is replaced 
by L means that eq. 13.121 reduces to the Hamilton- Jacobi equation, eq. 12.231 The 
result now follows from result (ii) at the end of Section 12.21 

This result is a kind of converse of our deduction of eq. 13.21 We can sum up this 
situation by saying that the canonical equations characterize any field belonging to a 
family of geodesically equidistant hypersurfaces. 

Finally, we should note an alternative to our order of exposition. We assumed 
at the outset a family of hypersurfaces, and then discussed an associated congruence 
and field. But one can instead begin with a single arbitrary surface; then define the 
notion of an extremal being transverse to the surface (in terms of the fundamental 
integral being stationary as an end-point varies on the surface — cf. footnote 6); then 
define a field of such transverse extremals; and finally define other surfaces, geodesically 
equidistant to the given one, as surfaces S = constant, where S(qi,t) is defined to be 
the value of the fundamental integral taken along a transverse extremal from the given 
surface (S = 0) to the point (qi,t). This alternative order of exposition is adopted 
by Courant and Hilbert (1962, Chap. II. 9. 2-5), and (more briefly) by Born and Wolf 
(1999, Appendix 1.2-4). It has the mild advantage over ours of clearly displaying the 
choice of an arbitrary initial surface; (which accords with the solution of a partial 
differential equation involving an arbitrary function just as the solution of an ordinary 
differential equation involves an arbitrary constant or constants). It will also come up 
again in Sections El and [7| 
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4 Hilbert's independent integral 



A canonical field belonging to a geodesically equidistant family of hypersurfaces defines 
a line-integral which is independent of its path of integration. This integral, named 
after its discoverer Hilbert, is important not only in Hamilton- Jacobi theory, but also 
in aspects of the calculus of variations which we do not discuss, e.g. the study of 
conditions for the fundamental integral to take extreme values. 

Suppose we are given a geodesically equidistant family of hypersurfaces covering 
region G simply. Consider two arbitrary points Pi,P 2 G G lying on hypersurfaces 
S = o"! , S = a 2 respectively; and consider an arbitrary C l curve C : qi = qiit) lying in 
G and joining Pi and P 2 . We will write the components of the tangent vector (dqi/dt, 1) 
of C as (q'^ 1); for we continue to use the dot ' for differentiation along the geodesic 
gradient of the field belonging to S. Now consider the integral along C of dS, so that 
the integral is trivially path-independent: 

J := / dS(q i7 t) = a 2 -a 1 = —q[ + — ] dt . (4.1) 



Pi " J Pi \dqi 1 dt J 

We can apply Pi = and the Hamilton- Jacobi equation to the first and second terms 
of the integrand respectively, to get a path-independent integral 

rP2 

J= (PiQi- H(qi, Pi ,t)) dt = a 2 -a x . (4.2) 
JPi 

We can also Legendre transform to eliminate the pi in favour of <ji, getting 

J = j p iHqi, qi, t) + gT- (qi - 4i) J dt = o 2 -o x . (4.3) 

It is in this form that J is usually called the Hilbert integral. 
A field 

qi = qi(u a , t) pi = pi(u a , t) (4.4) 

(assumed to belong to a family of hypersurfaces in the sense of eq. 13 .7|) is called 
a Mayer field if substituting q^Pi in the integral in eq. 14.21 yields an integral that 
is path- independent . So we have seen that a canonical field is a Mayer field. One 
can show that the converse holds, i.e. any Mayer field is canonical (Rund 1966, p. 
33). So we have the result: a Mayer field is a canonical field belonging to a family 
of geodesically equidistant hypersurfaces. (It can also be shown that every extremal 
curve can be imbedded in a Mayer field.) 

Combining this with the results of Section El we also have: the field eq. 14.41 is a 
Mayer field iff the Lagrange brackets [u a , vanish and the field obeys the canonical 
equations eq. 13.21 
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5 The parameter as an additional (/-coordinate 



As we said at the start of Section |2~T1 our theory has depended from the outset on the 
choice of £; (cf. the fundamental integral eq. I2.1J1 . Indeed, we saw at the end of Section 
12.11 that the non- vanishing Hessian eq. 12.41 implies that L cannot be homogeneous of 
the first degree in the g$; i.e. we cannot have for all A G 1R, L(qi, Xtji, t) = XL(qi,qi,t). 
And we shall shortly see that this implies that the fundamental integral cannot be 
parameter-independent. 

But for some aspects of the theory, especially the next Section's discussion of 
Hamilton- Jacobi theory as an integration theory for first order partial differential equa- 
tions, it is both possible and useful to treat t as a coordinate on a par with the gs. So 
in this Section, we describe such a treatment and the gain in symmetry it secures. 

To have some consistency with our previous notation, we first consider a Lagrangian 
L(q a ,q a ,t) with n — 1 coordinates q a , a parameter t and derivatives q a = dq a /dt. So 
note: in this Section, Greek indices run from 1 to n — 1. So the fundamental integral 
along a curve C : q a = q a (t) in a suitable region G of IR n joining points -Pi, A with 
parameters ti, t% is 

rt-2 

I = / L(q a ,q a ,t) dt . (5.1) 

Now we introduce a real C l function r(t) which is such that dr/dt > for all values 
of t under consideration, but is otherwise arbitrary. We write derivatives with respect 
to r using dashes, so that 

(5.2) 

So with n := r(ti), r 2 := r(i 2 ), we can write eq. 15.11 as 

r 2 ( , dr \ dt , . 

If we now write q n for t, so that we can write the coordinates on H n as 

q% = (<?«, t) = (q a , q n ) i = l,...,n and write ^- = q' n ^ , (5.4) 

ar 

then we can write eq. 15.31 as 

/= f 5 L*(q t ,q')dT (5.5) 

where we have defined 

L*( qi ,ql):=L%q a ,t,C<Q--=L(qa,^,tJ • (5.6) 

We stress that the values of the integrals eq.s 15.11 and 15.51 are equal. But the 
latter is by construction parameter-independent, since the choice of r is essentially 
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arbitrary. Also L* is by construction positively homogeneous of the first degree in the 
Qi = (QaiQn) — i- e - f° r a ^ positive numbers A, L*(q i: Xq'j) = XL(q i ,q' i ) — irrespective of 
the form of the given Lagrangian L. In fact one can easily show that these two features 
are equivalent. 

For the purposes of the next Section, we will now express the canonical equations 
of our variational problem, eq.s l5.ll or 15.51 in the new notation. But note: the total dif- 
ferentiation on the left-hand sides of the canonical equations will still be differentiation 
with respect to the original parameter t — and so indicated by a dot. 

Writing the conjugate momenta of L* as p* for the moment, we have 

P * = 7TT = IT- — T<7n = Pa (5.7) 
dq' a dq a q' n 

so that these are identical with the original conjugate momenta; and so we will drop 
the * in p* a . So the canonical equations for the indices 1, . . . , n — 1 are given, with the 
original Hamiltonian (Legendre) function H(q a ,p a ,t) as defined in eq. 12. 5( by 

q a = t— , p a = • (5.8) 

dp a dq a 

On the other hand, for the new p n , we have 

dL* dL q' 

Pn ■= -K-y = L - S a — — f = L - S a p a q a . (5.9) 
Oq n Oq a q n 

Comparing with the definition eq. 12.51 of the Hamiltonian (Legendre) function, this is 

p n + H(q QjPa ,t) = 0. (5.10) 

So differentiating p n with respect to the original parameter t along an extremal gives 

_ dp _ dH dH dH 
^ n dt dt dt dq n 

which fits well with eq. 15.81 (here — = — ^ follows as usual from the canonical 
equations, i.e. from i7's Poisson bracket with itself vanishing identically). But note 
that we also have q n := ^ = 1 ^ J^- = -1. 

However, we can use the Hamilton- Jacobi equation to overcome this last "wrinkle" . 
i.e. to get a greater degree of symmetry. We can write the Hamilton- Jacobi equation 
of our variational problem eq. 15.11 as 

^( dS\ / dS \ dS n , c1n . 

where $ is defined function of 2n variables by 

$(q h Pi) := H(q a ,p a ,q n ) +p n . (5.13) 



Now if the p a in eq. 15.131 refer to a field of extremals belonging to a solution S(q a , q n ) 
of the Hamilton- Jacobi equation, so that p a = then by eq.s 15.11)1 and I5.12"! we also 



have: p n = j^j-. Besides, eq. 15. 131 implies immediately 

d$__dH_ d$ _ d# ^5. _ ! (_ ^ = ^) ( 5 14 ) 

% dqi ' dp a dp a ' dp n n dt 

It follows that we can write the canonical equations eq. 15. 8[ together with the relations 
for q n ,p n , in a completely symmetrical way in terms of $ as 

a--** v-- 9 - ■ (515) 

where, note again, the dot denotes differentiation with respect to t. 




6 Integrating first order partial differential equa- 
tions 

As mentioned in Section ^ we will not expound the usual approach (with Jacobi's 
theorem) to Hamilton- Jacobi theory as an integration theory for first order partial dif- 
ferential equations. 8 Instead, we will in this Section briefly introduce another approach 
which exploits the results and concepts of the previous Sections; (for more details, cf. 
Rund, 1966, Chap. 2.8). 

We will consider a partial differential equation of the form 

= , i — 1, . . . , n ; with — — ^ for at least one % , (6.1) 

OPi 

and $ of class C 2 in all 2n arguments. One of the i for which ^ may be identified 
with t, but this is not necessary: as in the previous Section, our discussion can treat 
all coordinates of IR n on an equal footing. We shall also assume that (as suggested 
by the Hamilton- Jacobi equation) the unknown function 5* does not occur explicitly in 
the equation; but this is not really a restriction, since one can show that the general 
case, i.e. an equation in which S occurs, can be reduced to the form of eq. 16.11 by 
introducing an additional independent variable. 

So the initial value problem we are to solve is: to find a function S(qi) (g, G G) that 
satisfies eq. 16. II and that assumes prescribed values on a given (n — l)-dimensional C 2 
surface, V say, in G. We will indicate how to explicitly construct such a function by 
using a congruence of "canonical" curves which solve a canonical system of ordinary 
differential equations; (so we reduce the integration of the partial differential equation 

8 For an exposition cf. the references in footnote 4. As to the history: Whittaker (1959, p. 264,316) 
reports that this theory was developed by Pfaff and Cauchy in 1815-1819, using earlier results by 
Lagrange and Monge; i.e. well before Hamilton's and Jacobi's work! 
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to the problem of integrating ordinary differential equations). This canonical system 
of equations will be suggested by our previous discussion; and the strategy of the 
construction will be to adjust the congruence of curves from an initial rather arbitrary 
congruence, to one that provides a solution to eq. 16. II 9 

Thus our previous discussion (especially Sections El and EJ) suggests we should con- 
sider the system of 2n ordinary differential equations, with a new parameter s 

Qi-= ~r = — o > Pi '= ~r = o • (6-2) 

as opi as oqi 

These are called the characteristic equations of eq. 16.11 A curve qi = qi(s) of IR n that 
satisfies them is called a characteristic curve of eq. 16.11 it will be an extremal of a 
problem in the calculus of variations if eq. 16. II is the Hamilton- Jacobi equation of such 
a problem. Our approach to integrating eq. 16.11 applies to these characteristic equa- 
tions theorems about the existence and uniqueness of solutions of ordinary differential 
equations, so as to secure the existence and uniqueness of solutions to eq. 16.11 

Let us consider an (n — l)-parameter congruence of characteristic curves, with 
parameters Ui, . . . , u n -i, so that we write 

qi = qi(s,u a ) , Pi = Pi(s,u a ) . (6.3) 

Since $ is C 2 , it follows from eq. 16.21 that the functions 16.31 are C 2 in s. We will 
also assume that these functions are C 2 in the u a ; and that this congruence covers the 
region G simply, with 

"'*•) ^0, (6.4) 



d(s,m, . . .,u n -i 
so that we can invert the first set of eq. 16.31 for (s,u a ), getting 

s = s(qi) , u a = u a (qi) . (6.5) 

We shall also write (in G): 

<j)(s,u a ) := $(qi(s,U{3),Pi(s,Up)) . (6.6) 

One can now show: 

(i) : (j) of eq. 16.61 is an integral of the characteristic equations eq. 16.21 i.e. ^ = 0; 

(ii) : the Lagrange brackets [u a ,s] and [u a , Up) are constant along any member of 
the congruence defined by eq. 16.21 

We now make some assumptions about the relation of our characteristic congruence 
to the given surface V. We will assume that through each point of V there passes a 
unique member of the congruence, and that the congruence is nowhere tangent to V. 



9 We remark at the outset that since — as in previous Sections — we work in a "configuration space" , 
not its twice-dimensional "phase space" , there are many "canonical congruences" , rather than a unique 
one; so that this sort of adjustment is possible. 
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Thus each point in V is assigned n — 1 parameter values u a and a value of s; so we can 
write s on V as a C 2 function of w a , the parameters of the unique curve through the 



point. Let us write this as s 



<7[Ur 



so that the functions aj(u a ) defined by 



ai(u a ) := qi(a(u a ),u a ) 



(6.7) 



are also C 2 . Finally we will suppose that we seek a solution of eq. 16. II which takes the 
values c{u a ) on V, c prescribed C 2 functions. 

That completes the assumptions needed for the construction of a (local) solution 
of eq. 16.11 (and the proof of its uniqueness). We end this Section by briefly describing 
the first steps of the construction. 

The theory of first order ordinary differential equations implies that the congruence 
of characteristic curves for eq. 16.21 is determined if the values of qi and pi are prescribed 
on V. The initial values of qi are of course to be given by the cij of eq. 16.71 But as to 
the initial values of the Pi, i.e. hi defined by 



h{u a ) := Pi(cr(u a ),u a ) 



(6i 



we have some choice. The strategy of the construction is, roughly speaking, to define a 
function S on G, in such a way that when we adjust the hi so that pi = S becomes 
a solution of eq. 16.11 in G, possessing the required properties. 

We now define a function z = z(s, u a ) on G in terms of V, the values c{u a ) pre- 
scribed on V and the given congruence; in effect, this z will be the desired 5*, once the 
hi are suitably adjusted. For each point P G G, with its n parameter values (s, u a ), the 
s- value of the intersection with V of the unique curve through P is given by s = a(u a ). 
We define the value of z at P by 



z(s,u a ) := c(u a ) + I 



Pi(r, U a ) 



d$(qj(T, U a ),Pi(T, U a j) 
dpi 



dr 



(6.9) 



where the integration is to be taken along the curve through P, from its point of 
intersection with V, to P. 

We will not go further into the construction of the desired S, except to make two 
remarks. (1): Note that eq. 16.91 implies in particular that z(a(u a ),u a ) = c(u a ). 

(2): Differentiating eq. 16. 91 with respect to s and using the first set of eq. 16 .21 yields 



dz ^ 9$ 

OS OPi 



Si Piiji 



(6.10) 



This is analogous to the relation S = T^iPiqi between a scalar function, such as a 
solution S of the Hamilton- Jacobi equation and the field q^Pi belonging to it, i.e. the 
field such that pi = cf. eq. 13.71 Indeed, if we use eq. 16.51 to define a function S on 
G by 

S(qi) := z(s(qi),u a (qi)) (6.11) 
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then one can show (again, we omit the details!) that: 

(i) we can adjust the 6, so as to make Pi = §jp hold; and 

(ii) that this adjustment makes S, as defined by eq. 16.111 fand so 16. 9|) . a solution of 
eq. 16. II with the required properties. 



7 The characteristic function and geometric optics 

In this Section, we follow in Hamilton's (1833, 1834!) footsteps. We introduce the 
Hamilton- Jacobi equation via the characteristic function (as do most mechanics text- 
books); and then apply these ideas to geometric optics — so our discussion will (at 
last!) make contact with physics. The main point will be that the correspondence in 
our formalism between canonical extremals and geodesically equidistant hypersurfaces 
underpins the fact that both the corpuscular and wave conceptions of light can account 
for the phenomena, viz. reflection and refraction, described by geometric optics. 10 

We assume that our region G C lR n+1 is sufficiently small that between any two 
points Pi = (qn, t%), P 2 = (q 2i , t 2 ) there is a unique extremal curve C. To avoid double 
subscripts, we will in this Section sometimes suppress the i, writing P 1 = (q l7 P 2 = 
(q 2 ,t 2 ) etc. Then the value of the fundamental integral along C is a well-defined 
function of the coordinates of the end-points; which we call the characteristic function 
and write as 

S(qi,h; q 2 ,t 2 ) = / L dt = (^p^ - H) dt = ^pidqi - Hdt (7.1) 
Jti Jti J 

where the integral is understood as taken along the unique extremal C between the 
end-points, and we have used eq. 12.51 

Making arbitrary small displacements (6qi, 5ti), (6q 2 , St 2 ) at Pi, P 2 respectively, and 
using the fact that the integral is taken along an extremal, we get for the variation in 
S 

5S := S(qi + Sqi,h + 8tr, q 2 + 5q 2 ,t 2 + 5t 2 ) - S(qi,t 1 ; q 2 ,t 2 ) = 
OS dS dS OS 

aT^ 1 + a7"^ 2 + Si a — + Si a — = ~ H (ljiPji 0^] t ? • ( 7 - 2 ) 

oti °t 2 oqii oq 2 i 

Since the displacements are independent, we can identify each of the coefficients on the 
two sides of the last equation in eq. 17.21 getting 

wr = -[H(qi,Pi,t)} t =t 2 , ^- = [Pi]t=ta ( 7 - 3 ) 
ot 2 oq 2 i 

^-=[H{q h Pi,t)] t=tl , tt^- = -Nt=ti ( 7 - 4 ) 



10 This is an example of what philosophers call "under-determination of theory by data" . The 
escape from this sort of quandary is of course the consideration of other phenomena: in this case, the 
nineteenth-century study of diffraction and interference, which led to the rise of wave optics — cf. the 
start of Section |H1 

on 



in which the Pi refer to the extremal C at Pi and P2. 

These equations are remarkable, since they enable us, if we know the function 
S(qi,ti,q2,tz) to determine all the extremals (in mechanical terms: all the possible 
motions of the system) — without solving any differential equations! For suppose we are 
given the initial conditions (qi,Pi,tx), (i.e., in mechanical terms: the configuration and 
canonical momenta at time £1), and also the function S. The n equations ^ = —p\ 
in eq. 17.41 relate the n + 1 quantities (q 2 ,t 2 ) to the given constants qi,pi,ti. So in 
principle, we can solve these equations by a purely algebraic process, to get q 2 as a 
function of t 2 and the constants qi,pi,t\. Finally, we can get p 2 from the n equations 
p 2 = J-^ in eq. 17.31 So indeed the extremals are found without performing integrations, 
i.e. just by differentiation and elimination: a very remarkable technique. 11 

Substituting the second set of equations of eq. 17.31 in the first yields 

_+*(„,_,„) = „. (7.5) 

So the characteristic function S(qi,ti, q 2 ,t 2 ) considered as a function of the n + 1 
arguments (q 2 ,t 2 ) = (q 2 i,t 2 ) (i.e. with (gi,ti) fixed) satisfies the Hamilton- Jacobi 
equation. 

Assuming that this solution S is C 2 , it follows from result (ii) of Section POl that S 
defines a family of geodesically equidistant hyper surf aces, namely the geodesic hyper- 
spheres (for short: geodesic spheres) with centre Pi = (qi,ti). Thus the sphere with 
radius R is given by the equation 

S(q 1 ,t 1 ;q 2 ,t 2 ) = R (7.6) 

with (q±,ti) considered fixed. So every point P 2 on this sphere is connected to the 
fixed centre Pi = (<Zi,ii) by a unique extremal along which the fundamental integral 
has value R. These extremals cut the spheres eq. 17.61 transversallv. 

These geodesic spheres about the various points Pi are special families of hypersur- 
faces. For by taking envelopes of these spheres, we can build up successive members of 
an arbitrary family of geodesically equidistant hypersurfaces. This is the basic idea of 
Huygens ' principle in geometric optics. Though Huygens first stated this idea as part 
of his wave theory of light, it can be stated entirely generally. Indeed, there is a rich 
theory here. We will not enter details 12 , but just state the main idea. 

Thus consider some arbitrary solution S(qi,t) of the Hamilton- Jacobi equation 

dS TT/ dS , 



11 As Hamilton of course realized. He writes, in the impersonal style of the day, that 'Mr Hamilton's 
function S ... must not be confounded with that so beautifully conceived by Lagrange for the more 
simple and elegant expression of the known differential equations [i.e. L\. Lagrange's function states, 
Mr Hamilton's function would solve the problem. The one serves to form the differential equations of 
motion, the other would give their integrals' (1834, p. 514). 

12 For details, cf. Baker and Copson (1950), Herzberger (1958). In optics, the Hamilton-Jacobi 
equation is often called the eikonal equation. 
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and thereby the canonical field (congruence) K belonging to it, for which pi — Let 
hi,h 2 be two hypersurfaces corresponding to values <7i,<7 2 of S, i.e. G hj, (j = 

1,2) iff S(qi,t) = Oy Let P\ be in h\, and let the canonical extremal C through Pi 
intersect h 2 in P 2 . Then we already know from eq. 12. 191 that the fundamental integral 
along C is 

/ Ldt = a 2 -a l (7.8) 
JPi 

so that P 2 is in the geodesic sphere centred on Pi with radius o 2 — <j\. Huygens' 
principle states that more is true: h 2 is the envelope of the set of geodesic spheres of 
radius a 2 — o\ with centres on the hypersurface h\. 

As a final task for this Section, we briefly illustrate our formalism with another topic 
in geometric optics: namely, Fermat's "least time" principle, which states (roughly 
speaking) that a light ray between spatial points Pi and P 2 travels by the path that 
makes stationary the time taken. This illustration has two motivations. First: together 
with the next Section's discussion, it will bring out the optico-mechanical analogy — and 
so prompt the transition to wave mechanics. 

Second: it illustrates how our formalism allows t to be a coordinate like the qi, 
even though it is singled out as the integration variable; (cf. Section EJ). In fact, 
there are subtleties here. For if one expresses Fermat's principle using time as the 
integration variable, one is led to an integrand that is in general, e.g. for isotropic 
media, homogeneous of degree 1 in the velocities g,; and as noted in remark 2) at the 
end of Section 12.11 this conflicts with our requirement of a non- vanishing Hessian (eq. 
12 .4j) . i.e. with our construction of a canonical formalism. So illustrating our formalism 
with Fermat's principle in fact depends on using a spatial coordinate as integration 
variable (parameter along the light's path). As we will see in a moment, this gives an 
integrand which is in general, even for isotropic media, not homogeneous of degree 1 
in the velocities — so that we can apply the theory of Sections El onwards. 

So now our preferred coordinate t will be (not time, as it will be in mechanics) but 
one of just three spatial coordinates (q\,q 2 ,t) for ordinary Euclidean space. In fact, 
applications of geometric optics, e.g. to optical instruments which typically have an 
axis of symmetry, often suggest a natural choice of the coordinate t. 

At a point P = (qi,q 2 ,t) in an optical medium, a direction is given by direction 
ratios (qi, q 2 , t) = (qi, q 2 , 1). (So note: the subscripts 1 and 2 now refer to the first and 
second of three spatial axes "at a single time" — and not to initial and final configura- 
tions.) The speed of a ray of light through P in this direction will in general depend 
on both position and direction, i.e. on the five variables (qi,qi,t),i = 1,2; and so the 
speed is denoted by v(qi,qi,t). If c is the speed of light in vacuo, the refractive index 
is defined by 

n(qi,qi,t) := c/v(qi,qi,t) . (7.9) 

If n is independent of the directional arguments qi (respectively: positional arguments 
qi,t), the medium is called isotropic (respectively: homogeneous). 
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Now let the curve C : = qi(t) represent the path of a light-ray between two points 
Pi, P2 with parameter values t — t±, t — £2- Then the time taken to traverse this curve 
(the optical length of the curve) is 

T = * ds = * nfe^j + + idt= * L dt 

Jti v Jti c Jti 

where we have defined 

^,gi,t):=^^[(gi) 2 + (?2) 2 + i]^ • (r.ii) 

However, our discussion will not be concerned with this special form of L. We will 
only require that L be C 2 , and that the Hessian does not vanish, i.e. eq. 12.41 holds. 
One immediately verifies that this is so for isotropic media; (in fact the Hessian is 

^#[(gi) 2 + fe) 2 + i]-V0). 

We can now connect our discussion with the principles of Fermat and Huygens. We 
can again take Fermat's principle in the rough form above, viz. that a light ray between 
points Pi and P2 travels by the path that makes stationary the time taken. It follows 
that if light is instantaneously emitted from a point-source located at Pi = (qu,ti) 
(where now we revert to using '1' to indicate an initial location), then after a time 
T the light will register on a surface, F{T) say, such that each point P2 = (fe,^) 
on F(T) (where similarly, '2' indicates a final location) is joined to Pi by an extremal 
along which the fundamental integral assumes the common value T. This surface is the 
wave-front for time T, due to the point-source emission from Pi. Clearly, the family 
of wave-fronts, as T varies, is precisely the family of geodesic spheres (for L as in eq. 
l7~TTf around P x . 

Using the Hamilton- Jacobi equation eq. 17.71 (now with just three independent vari- 
ables qi,q2,t), we can readily generalize this, so as to describe the construction of suc- 
cessive wave-fronts, given an initial wave-front. Given an arbitrary solution S(q±, q2,t) 
of eq. 17.71 and an initial hypersurface hi given by S(qi,t) = a x , we can construct at 
each point Pi G hi the unique extremal of the canonical field belonging to the family of 
hypersurfaces of constant S. By Fermat's principle, each such extremal can represent 
a ray emitted from P x . If we define along each such extremal the point P2 such that 
that fundamental integral fpf L dt attains the value T, then the locus of these points 
P 2 is the surface S = o\ + T. Thus we construct a family of geodesically equidistant 
hypersurfaces. 13 To sum up: each solution of the Hamilton- Jacobi equation represents 
a family of wave-fronts, and the canonical field belonging to a family represents the 
corresponding light rays. 

13 The vector pi — is longer the more rapidly S increases over space, i.e. the more rapidly the 
light's time of flight increases over space. So Hamilton called pi the vector of normal slowness. 
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8 From the opt ico- mechanical analogy to wave me- 
chanics 



The rise of wave optics in the nineteeth century led to geometric optics being regarded 
as the short- wavelength regime of a wave theory of light. So its equations and principles, 
such as the Hamilton- Jacobi equation and Fermat's and Huygens' principles, came to 
be seen as effective statements derived in the short-wavelength limit of the full wave 
theory. But the details of these derivations are irrelevant here. 14 

For us the relevant point is that (as is often remarked: e.g. Synge (1954, Pref- 
ace), Rund (1966, p. 100)): once one considers this development, together with the 
optico- mechanical analogy as stated so far (i.e. as it stood for Hamilton), it is natural 
to speculate that there might be a wave mechanics, just as there is a wave optics. 
That is, it is natural to speculate that classical mechanics might describe the short- 
wavelength regime of a wave mechanics, just as geometric optics describes the short- 
wavelength regime of a wave optics. This is of course precisely what deBroglie, and 
then Schrodinger, did. To be more specific, using our Hamilton- Jacobi perspective: 
they proposed that S represented, not an ensemble of systems each fully described by 
its classical mechanical state (q,p), but a property of an individual system. 15 

In this Section, we give a simple sketch of this proposal. But we shall not give details 
of deBroglie's and Schrodinger's own arguments, which are subtle and complicated: 
(Dugas (1988, Part V, Chap. 4) gives some of this history). Our sketch is formal, 
though in the textbook tradition (Rund (1966, pp. 99-109) and Goldstein (1950, pp. 
307-314)); (various books give fuller accounts e.g. using the concepts of Fourier analysis 
and the group velocity of a wave-packet, e.g. Messiah (1966, pp. 50-64), Gasiorowicz 
(1974, pp. 27-32)). More precisely: we will first describe how when we apply Hamilton- 
Jacobi theory to a classical mechanical system, the S'-function defines for each time t 
surfaces of constant S in configuration space, so that by varying t we can calculate the 
velocity with which these "wave-fronts" propagate (in configuration space). So far, so 
classical. But then we will postulate that these wave-fronts are surfaces of constant 
phase of a time-dependent complex-valued wave-function on configuration space. This 
will lead us, with some heuristic steps, to the Schrodinger equation and so to wave 
mechanics. 

Let us consider a classical mechanical system with holonomic ideal constraints, 

14 cf. e.g. Born and Wolf (1999, Chap. 3.1, 8.3.1); Taylor (1996, Section 6.6-6.7) is a brief but 
advanced mathematical discussion. 

15 Of course, successful proposals often seem "natural" in hindsight; and some authors (e.g. Gold- 
stein (1950, p. 314)) maintain that deBroglie's and Schrodinger's proposal would have seemed merely 
idle speculation if it had been made independently of the introduction of Planck's constant and the 
subsequent struggles of the old quantum theory. Indeed, even in that context it was obviously: (i) 
daringly imaginative (witness the fact that the 5* wave propagates in multi-dimensional configuration 
space); and (ii) confusing (witness the interpretative struggles over the reality of the wave- function) . 
In any case, whether the proposal was natural or not — after all, 'natural' is a vague word — all can 
now agree that their achievement was enormous. 
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on which the constraints are solved so as to give a n-dimensional configuration space 
Q, on which the qi are independent variables. More technically, Q is a manifold, 
on which the qi are a coordinate system, and on which the kinetic energy defines a 
metric. But we shall not go into this aspect: we shall simply assume Q is equipped 
with the usual Euclidean metric on H n , and that the qi are rectangular coordinates. 
We further assume that any constraints are time-independent (scleronomous); i.e. any 
configuration in Q is possible for the system throughout the time period in question. 
The result of these assumptions is that the region G C ]R n+1 for which the formalism 
of Sections 121 has been developed is now assumed to be an 'event space' or 'extended 
configuration space' of the form Q x T, where T C 1R is some real interval representing 
a period of time. Finally, we will assume that our system is conservative, with energy 
E. 

Now we will presume, without rehearsing the usual equations (cf. especially Sec- 
tion EH] and eq.s 17.11 to 17. 5|) . that using the above assumptions, the Lagrangian and 
Hamiltonian mechanics of our system has been set up. So if S(qi,t) = o is a family of 
geodesically equidistant hypersurfaces associated with the system (each hypersurface 
n-dimensional) , the family covering our region G simply, then S satisfies the Hamilton- 
Jacobi equation in the form + E = 0. This can be immediately integrated to give, 
for some function S* of q{ only, 

S( qi ,t) = S*( qi )-Et. (8.1) 

(So the pi of the canonical field depend only on S*: Pi '■= ^ = ffr-) So the hypersur- 
faces of our family can be written as 

S*{ qi )=Et + a. (8.2) 

For any fixed t, a hypersurface of constant S, considered as a hypersurface in the 
configuration space Q (a hypersurface of dimension n — 1, i.e. co-dimension 1), e.g. the 
surface S(qi,t) = Oi, coincides with a hypersurface of constant S*: for this example, 
the surface S* = o\ + Et. But while the surfaces of constant S* are time-independent, 
the surfaces of constant S vary with time. So we can think of the surfaces of constant 
S as propagating through Q. With this picture in mind, let us calculate their velocity. 

(We can state the idea of surfaces in Q of constant S more rigorously, using our 
assumption that the region G C !R n+1 is of the form Q x T. This implies that any 
equation of constant time, t = constant, defines a n-dimensional submanifold of G 
which is a "copy" of Q; let us call it Q t . Each hypersurface in eq. 18.21 defines a 
(n — l)-dimensional submanifold of Q t (a hypersurface in Q t of co-dimension 1) given 
by 

S*(qi) = Et + constant, (with t = constant). (8.3) 

Then, as in the previous paragraph: fixing the constant a but letting t vary, and iden- 
tifying the different copies Q t of Q, we get a family of (n — l)-dimensional submanifolds 
of Q, parametrized by t. This can be regarded as a wave-front propagating over time 
through the configuration space Q.) 

9* 



Let us fix a constant o and a time t; let P = (qi) e Q be a point on the surface 
S = S* — Et = a; and consider the normal to this surface (pointing in the direction of 
propagation) at P. (So the ith component of the unit normal is i%{ =| VS 1 * .) 
Consider a point P' = (g« + (%) that lies a distance (is from P along this normal: (so 
dqi = riids). P' is on a subsequent wave-front (i.e. with the same value a of S, but not 
of S*) at time t + dt, where by eq. 18.21 

(99* 

dS* = —d qi = E dt; (8.4) 
dqi 

which, dividing by ds, yields 

— = I VS* I = = E -. (8.5) 

ds % dqi ds ds 

<99* 

Pi = -a- =*► p := | p | = | V5* | . (8.6) 
Combining these equations, eq. I8.5l and l%l)| we deduce that the speed u of the wavefront 

ds 
dt ■ 

E 

u = - . (8.7) 
V 



But we also have 



S = a, i.e. u := 4f, is 



So far, so classical. But now we postulate that the wave-fronts eq. 18.21 (or 
are surfaces of constant phase of a suitable time-dependent complex-valued function -0 
on Q. This postulate, together with some heuristic steps (including a judicious iden- 
tification of Planck's constant!), will give us a heuristic derivation of the Schrodinger 
equation. We will assume to begin with that we can write the postulated function 
if) = if>(qi,t) as 

if) = R(qi > t)exp[-2m(ut-<f>{q i ))] , (8.8) 

with R and <fi real; so that ut — <f> is the phase, and (apart from R's possible t- 
dependence) v is the frequency associated with ip. Then our postulate is that there is 
some constant h such that 

h(vt-<P(q t )) = (Et-S*(q t )) . (8.9) 

But this must hold for all q iy t, so that 

E = hv ■ S*{ qi ) = h<f>( qi ) . (8.10) 

So the postulated frequency is proportional to the system's energy. Then, using our 
previous calculation of the speed u, and the relation u = \v with A the wavelength, we 
deduce that the wavelength is inversely proportional to the magnitude of the system's 
momentum. That is: 

u = Xu=- => X = - . (8.11) 
p p 
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Substituting eq. 18.101 in eq. 18.81 we can write ip as 



if} = R(q h t)exp 



2 -f(S\ qi )-Et) 



R{qi,t) exp 



jriSTfa) - Et) 



(8.12) 



where we have defined h :- 



2vr' 



Assuming now that R has no (^-dependence, differentiation of eq. 18. 121 with respect 
to qi yields 

d± = idS^ 
dqi h dqi 

Recalling that = this is an eigenvalue equation, and suggests that we associate 
with the ith component of momentum p^ of a system whose R has no ^-dependence, 
the operator pi on wave-functions if> defined by 



h d 

Pi ■= --z- , * = 1, 

% dqi 



(8.14) 



Let us postulate this association also for ^-dependent R. Then this suggests we also 
associate with the energy of the system, the operator H on wave-functions defined by 



H := H(qi,pi,t) 



U5) 



(where we understand qi, and functions of it, as operating on wave-functions by ordi- 
nary multiplication). 

But assuming now that R has no t-dependence, differentiation of eq. 18.121 with 
respect to t yields 

ifr^t = Eijj (8.16) 

suggesting we should associate with the energy of a system, the operator E on wave- 
functions defined by 

d 

E:=ih— (8.17) 
dt v ; 

(By the way, this definition is also motivated by treating time as a coordinate along with 
the qf, cf. the discussion in Section Thus eq. 18.141 suggests we define p n +i '■= f J^; 
and when this is combined with eq. 18.171 we get 



p n+1 + E = 



(8.18) 



which is analogous to eq. 15.101 ) 

If for general R(qi, t) we endorse both these suggestions — i.e. we identify the actions 
on eq. 18. 121 of these two suggested operators , eq. 18.151 and 18.171 — then we get 



Hip = ih 



dip 
~dt 



(8.19) 



which, once we identify h as Planck's constant, is the Schrodinger equation. 
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9 A glance at the pilot-wave theory 



So much by way of sketching the Hamilton- Jacobi perspective on the heuristic route to 
wave mechanics. In this final Section, I will briefly return to this volume's question ' Quo 
vadis, quantum mechanics?', i.e. to the foundations of quantum theory. First, I want to 
stress that Hamilton- Jacobi theory remains an important ingredient in various research 
programmes in this field. Prominent among these is the trajectory representation of 
quantum mechanics, pioneered by Floyd, and Faraggi and Matone. I cannot go into 
details, but would recommend, as places to begin reading, both Floyd (2002) and 
Faraggi and Matone (2000). (Besides, Section 1 of the latter ends with some references 
to other research programmes that use Hamilton- Jacobi theory.) 

I shall instead end on Hamilton- Jacobi theory in the context of another prominent 
research programme (related to the trajectory representation): deBroglie's and Bohm's 
pilot-wave theory. Again, this is a large topic, and we only wish to advertise the recent 
work of Holland (2001, 2001a). 

First, we recall (Bohm (1952, p. 169), Bohm and Hiley (1993, p. 28), Holland 
(1993, pp. 69,134)) that: 

(i) : Writing ij) = R{q,i,t) exp(iS(qi,t)/h) (R, S real) in the one-particle Schrodinger 
equation, eq. 18. 191 with H := |j^V 2 + V gives 

| + _ W + fl + r = 0*fl,±Y, (9.1) 

which looks like the classical Hamilton- Jacobi equation (cf. eq. 12.23(1 of a particle in 
an external potential that is the sum of V and Q, which Bohm called the 'quantum po- 
tential'; indeed Bohm and Hiley call eq. 19 .11 the "quantum Hamilton- Jacobi equation"; 
and 

^ + — V ■ (pVS) = with p:=R 2 . (9.2) 
at m 

(ii) : These equations suggest the quantum system comprises both a wave, propagat- 
ing according to the Schrodinger equation, and a particle, which has (a) a continuous 
trajectory governed by the wave according to the guidance equation 

dqi dS 

m H = dq- ; O-s) 

and (b) a probability distribution given at all times by | ip | 2 = R 2 . 

Besides, comments and equations similar to those in (i) and (ii) apply when we 
insert ip — Rexp{iS/h) into the many-particle Schrodinger equation (Bohm (1952, p. 
174), Bohm and Hiley (1993, p. 56 et seq.), Holland (1993, pp. 277 et seq.)). 

So far, so good. But Holland (2001, p. 1044) points out that the relation of pilot- 
wave theory to classical Hamilton- Jacobi theory is not transparent. In particular, he 
points out: 

1): The guidance law eq. 19.31 is 'something of an enigma'. It looks like one half of 
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a canonical transformation that trivializes the motion of a classical system (by trans- 
forming to a set of phase space coordinates that are constant in time. But what about 
the other half; and more generally, can eq. 19.31 be somehow related to a Hamiltonian 
or Hamilton- Jacobi theory? 

2): Q's dependence on S (through eq. 19. 2|) means that the "quantum Hamilton- 
Jacobi equation" eq. 19.11 in effect contains higher derivatives of S — wholly unlike a 
classical Hamilton- Jacobi equation. 

So Holland undertakes an extensive investigation of this relation. More precisely, 
he undertakes to formulate the pilot-wave theory as a Hamiltonian theory. He does 
this by assessing a treatment of Q as a field function of on a par with the classical 
potential V; i.e. a treatment that takes as the Hamiltonian of the (one-particle) system 

H{qi, Pi ,t) = ^-E iPl 2 + Q(q u t) + V{q u t) (9.4) 

He emphasises that such a treatment faces three obstacles. In brief, they are: 

(a) : As we just mentioned in 2), Q depends on S and so presumably, by p = ||, 
on p. So in a Hamiltonian (phase space) treatment, it seems wrong to take Q as a 
function only of q. 

(b) : The free choice of initial positions and momenta in a Hamiltonian treatment 
will mean that most motions, projected on q, do not give the orthodox quantal distri- 
bution, in the way that eq. 19.31 and | ip \ 2 = R 2 does. 

(c) : Is such a treatment compatible with the Hamiltonian description of the Schrodinger 
equation? For it to be so, we have to somehow formulate the particle-wave interaction 

so as to prevent a back-reaction on the wave. 

However, Holland goes on to show (2001, 2001a) that these obstacles can be over- 
come. That is, he vindicates the proposal, eq. 19.41 with a Hamiltonian theory of the 
interacting wave-particle system. But we cannot enter details. It must suffice to list 
some features of his work. In short, his approach: 

(1) : generalizes a canonical treatment of a classical particle and associated ensem- 
ble; 

(2) : necessitates the introduction of an additional field of which the particle is the 
source; 

(3) : makes the "quantum Hamilton- Jacobi equation" and the continuity equation 
eq. 19.21 (and other equations for the evolution of particle and field variables) come out 
as Hamilton equations; 

(4) : interprets p = ^| as a constraint on the phase space coordinates of the wave- 
particle system; 

(5) : gives a general formula expressing the condition that the particle's phase space 
distribution, projected on q, gives the orthodox quantal distribution; and finally, 

(6) : yields a Hamilton- Jacobi theory of the wave-particle system. 

To conclude: I hope to have shown that Hamilton- Jacobi theory, understood from 
the perspective of the calculus of variations, gives us insight into both mechanics and 
optics — and that, as illustrated by this last Section, Hamilton- Jacobi theory is an 
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important ingredient in current attempts to answer the question, l Quo vadis, quantum 
mechanics?'. 
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